A B C D E F G H I J K L M N O P Q R S T U V W X Y Z All
Karpagam, K.
- Enhanced Association Rule Mining Algorithm to Extract High Utility Itemsets from a Large Dataset
Authors
1 H.H. The Rajah's College (Autonomous), Pudukkottai, Tamil Nadu, IN
2 Dept. of Computer Applications, Karpaga Vinayaga College of Engineering and Technology, Kancheepuram, Tamil Nadu, IN
Source
Data Mining and Knowledge Engineering, Vol 7, No 7 (2015), Pagination: 238-241Abstract
Data mining aims at bringing out the hidden information from a large data set using data mining techniques according to the requirements. Association rule mining identifies itemsets that occur frequently in data set and frames association rules by taking all items equally. But many differences exist among the items that play a vital role in decision making. By taking one or more values of items as utilities, the utility mining technique works on finding the itemsets with greater utilities. In the proposed paper we present a utility mining algorithm named IUM (Improved Utility Mining) algorithm that finds high utility itemsets and also low utility itemsets from a large data set and the experiments states that the proposed algorithm performs better than existing algorithms in case of running time.Keywords
Association Rules, Frequent Itemsets, Low Utility Itemset, High Utility Itemset.- Comparative Analysis of Optimization Algorithms for Document Clustering
Authors
1 Department of Master of Computer Application, Dr. Mahalingam College of Engineering & Technology, Pollachi, IN
2 Department of Computer science and Engineering, Institute of Road and Transport Technology, Erode., IN
Source
Data Mining and Knowledge Engineering, Vol 9, No 6 (2017), Pagination: 120-125Abstract
Document clustering or text clustering is an unsupervised technique and it is used to grouping the documents of same context. Document clustering algorithms are widely used in web searching engines to produce results relevant to a query. Today, the information in websites is growing in huge size and it leads to the process of managing, retrieve the required and updated information is a tedious task. Also necessary to obtain the exact information required by the user from the documents. Recently optimization algorithms are introduced and are applied to the clustering algorithms. The Genetic Algorithm and Cuckoo Search algorithms are meta-heuristic optimization algorithms and are used to obtain the optimum solutions. In this paper, Genetic Algorithm and Cuckoo Search algorithm based Domain-specific Keyword Similarity based Knowledgebase Creation algorithm are proposed to optimize the document clustering to answers the question answering system. The experimental were conducted on benchmark datasets and the performance was analyzed in terms of Precision, Recall, F1, Missrate, Fallout and Purity.
Keywords
Cuckoo Search, Document Clustering, Genetic Algorithm, Information Processing Knowledge Base, Text Mining.References
- J.H. Holland, Adaptation in natural and artificial systems: an introductory analysis with applications to biology, control, and artificial intelligence. MIT press, 1975.
- Y.Xin-She, S.Deb, “Cuckoo search via lévy flights”, World Congress on Nature & Biologically Inspired Computing, NaBIC, , 2009,pp. 210–214.
- X.S.Yang and S.Deb, "Engineering Optimization by Cuckoo Search", J. Mathematical Modeling and Numerical Optimization, vol. 1, no. 4, 2010.
- K.Karpagam and A.Saradha, ”An Improved Question Answering System Using Domain Context Specific Document Clustering with Wordnet”, International Journal of Printing, Packaging & Allied Sciences, 2016, Volume 4, No. 5, Pages 3257 -3265
- H.Yang, T.Chua, S.Wang, C.Koh, ”Structured use of external knowledge for event- based open domain question answering“, In Proceedings of the annual international ACM SIGIR conference on research and development in information retrieval ACM, 2003, pp. 33–40.
- J.Jeon, W.Croft, and J.Lee, “Finding semantically similar questions based on their answers”, In Proceedings of the annual international ACM SIGIR conference on research and development in information retrieval, 2005.
- K.Iman, S.A.Mohammad, “Genetic programming-based feature learning for question answering”, Elsevier- Information Processing and Management, 2016.
- T.Ming, D.S.Cicero, X.Bing and Z. Bowen, Improved Representation Learning for Question Answer Matching”,, Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics, August, 2016,pp. 7-12.
- P.Pathak, M.Gordon, and W.Fan, “Effective information retrieval using genetic algorithms based matching functions adaption,” in Proc.33rd Hawaii International Conference on Science (HICS), Hawaii, USA, 2000.
- E.Abdessamad, H.Ulf, H.Eduard, M.Daniel, M.Eric, and R.Deepak, “How to Select Answer String”, Springer Netherlands, 2006.
- A.Mansaf, S.Kishwar, Web Search Result Clustering based on Cuckoo Search and Consensus Clustering”, Indian Journal of science and Technology,Volume 9, Issue 15, April, 2016.
- C.Cobos, H.M.Collazos, R.U.Munoz, M. Medoza, E.Leon and E.H.Veidema, “Clustering of web search results based on cuckoo search algorithm and balanced Bayesian information criterion”, Information Sciences,.2014,pp. :248- 264.
- J. Sethilnath, V. Das, S.N. Omkar, and V. Maniv, “Clustering using Levy flight cuckoo search”, Proceedings of Seventh International Conference on Bio-Inspired Computing: Theories andApplications,BIC-TA, 2012.
- S.Liu, F.Liu, C.Yu, and W. Meng, “An effective approach to document retrieval via utilizing WordNet and recognizing phrases", In Proceedings of the annual international ACM SIGIR conference on research and development in information retrieval (pp.266–272), ACM, 2004.
- Voorhess, Ellen, Graff, and David,” AQUAINT-2 information retrieval text research collection LDC2002T25, Web Download. Philadelphia, Linguistic data consortium 2008.
- M.Saeedeh, K.Dietrich, “Bridging the vocabulary gap between questions and answer sentences”, Elsevier- Information Processing and Management, 2015.
- S.Gunnar et al, “Setting Goals and Choosing Metrics for Recommender System Evaluations”, 5th ACM Conference on Dresden University of Technology Recommender Systems,Chicago, 2011.
- Heie, H. Matthias, Whittaker, W.D.Edward and S.Furui, “Question answering using statistical language modeling”,Computer Speech and Language, 26, , 2012 pp. 193–209.
- http://qwone.com/~jason/20Newsgroups/
- Graff, David, “The AQUAINT corpus of English News Text”, LDC2002T31, Web Download. Philadelphia, Linguistic data consortium, 2002.